Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 17379 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.4 MiB |
| Average record size in memory | 449.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 7 |
| BOOL | 3 |
Reproduction
| Analysis started | 2020-04-02 08:59:07.238055 |
|---|---|
| Analysis finished | 2020-04-02 08:59:51.622430 |
| Version | pandas-profiling v2.5.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
dteday has a high cardinality: 731 distinct values | High cardinality |
atemp is highly correlated with temp | High Correlation |
temp is highly correlated with atemp | High Correlation |
cnt is highly correlated with registered | High Correlation |
registered is highly correlated with cnt | High Correlation |
season_int is highly correlated with season | High Correlation |
season is highly correlated with season_int | High Correlation |
weathersit_int is highly correlated with weathersit | High Correlation |
weathersit is highly correlated with weathersit_int | High Correlation |
dteday only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
hr has 726 (4.2%) zeros | Zeros |
windspeed has 2180 (12.5%) zeros | Zeros |
casual has 1581 (9.1%) zeros | Zeros |
weekday_int has 2502 (14.4%) zeros | Zeros |
| Distinct count | 17379 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8690 |
|---|---|
| Minimum | 1 |
| Maximum | 17379 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 869.9 |
| Q1 | 4345.5 |
| median | 8690 |
| Q3 | 13034.5 |
| 95-th percentile | 16510.1 |
| Maximum | 17379 |
| Range | 17378 |
| Interquartile range (IQR) | 8689 |
Descriptive statistics
| Standard deviation | 5017.0295 |
|---|---|
| Coefficient of variation (CV) | 0.5773336593 |
| Kurtosis | -1.2 |
| Mean | 8690 |
| Median Absolute Deviation (MAD) | 4344.749986 |
| Skewness | 0 |
| Sum | 151023510 |
| Variance | 25170585 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.0000e+00 1.7379e+04], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 4727 | 1 | < 0.1% | |
| 12947 | 1 | < 0.1% | |
| 14994 | 1 | < 0.1% | |
| 8849 | 1 | < 0.1% | |
| 10896 | 1 | < 0.1% | |
| 17037 | 1 | < 0.1% | |
| 4743 | 1 | < 0.1% | |
| 6790 | 1 | < 0.1% | |
| 645 | 1 | < 0.1% | |
| Other values (17369) | 17369 | 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 17379 | 1 | < 0.1% | |
| 17378 | 1 | < 0.1% | |
| 17377 | 1 | < 0.1% | |
| 17376 | 1 | < 0.1% | |
| 17375 | 1 | < 0.1% |
| Distinct count | 731 |
|---|---|
| Unique (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| 2012-09-21 | 24 |
|---|---|
| 2012-12-11 | 24 |
| 2012-08-04 | 24 |
| 2011-12-10 | 24 |
| 2012-11-04 | 24 |
| Other values (726) |
| Value | Count | Frequency (%) | |
| 2012-09-21 | 24 | 0.1% | |
| 2012-12-11 | 24 | 0.1% | |
| 2012-08-04 | 24 | 0.1% | |
| 2011-12-10 | 24 | 0.1% | |
| 2012-11-04 | 24 | 0.1% | |
| 2012-01-09 | 24 | 0.1% | |
| 2012-03-16 | 24 | 0.1% | |
| 2011-12-18 | 24 | 0.1% | |
| 2012-08-09 | 24 | 0.1% | |
| 2012-09-13 | 24 | 0.1% | |
| Other values (721) | 17139 | 98.6% |
Length
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 10 | 90.9% | |
| Dash_Punctuation | 1 | 9.1% |
| Value | Count | Frequency (%) | |
| Common | 11 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 11 | 100.0% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| Summer | |
|---|---|
| Spring | |
| Winter | |
| Fall |
| Value | Count | Frequency (%) | |
| Summer | 4496 | 25.9% | |
| Spring | 4409 | 25.4% | |
| Winter | 4242 | 24.4% | |
| Fall | 4232 | 24.4% |
Length
| Max length | 6 |
|---|---|
| Mean length | 5.51297543 |
| Min length | 4 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 11 | 78.6% | |
| Uppercase_Letter | 3 | 21.4% |
| Value | Count | Frequency (%) | |
| Latin | 14 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 14 | 100.0% |
yr
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 8734 | 50.3% | |
| 0 | 8645 | 49.7% |
month
Categorical
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| May | 1488 |
|---|---|
| July | 1488 |
| December | 1483 |
| August | 1475 |
| March | 1473 |
| Other values (7) |
| Value | Count | Frequency (%) | |
| May | 1488 | 8.6% | |
| July | 1488 | 8.6% | |
| December | 1483 | 8.5% | |
| August | 1475 | 8.5% | |
| March | 1473 | 8.5% | |
| October | 1451 | 8.3% | |
| June | 1440 | 8.3% | |
| September | 1437 | 8.3% | |
| November | 1437 | 8.3% | |
| April | 1437 | 8.3% | |
| Other values (2) | 2770 | 15.9% |
Length
| Max length | 9 |
|---|---|
| Mean length | 6.142873583 |
| Min length | 3 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 18 | 69.2% | |
| Uppercase_Letter | 8 | 30.8% |
| Value | Count | Frequency (%) | |
| Latin | 26 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 26 | 100.0% |
| Distinct count | 24 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.54675183 |
|---|---|
| Minimum | 0 |
| Maximum | 23 |
| Zeros | 726 |
| Zeros (%) | 4.2% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 12 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.914405095 |
|---|---|
| Coefficient of variation (CV) | 0.5988181957 |
| Kurtosis | -1.198020588 |
| Mean | 11.54675183 |
| Median Absolute Deviation (MAD) | 5.988232784 |
| Skewness | -0.01067990952 |
| Sum | 200671 |
| Variance | 47.80899782 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 22.5 23. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 16 | 730 | 4.2% | |
| 17 | 730 | 4.2% | |
| 15 | 729 | 4.2% | |
| 13 | 729 | 4.2% | |
| 14 | 729 | 4.2% | |
| 22 | 728 | 4.2% | |
| 18 | 728 | 4.2% | |
| 19 | 728 | 4.2% | |
| 20 | 728 | 4.2% | |
| 21 | 728 | 4.2% | |
| Other values (14) | 10092 | 58.1% |
| Value | Count | Frequency (%) | |
| 0 | 726 | 4.2% | |
| 1 | 724 | 4.2% | |
| 2 | 715 | 4.1% | |
| 3 | 697 | 4.0% | |
| 4 | 697 | 4.0% |
| Value | Count | Frequency (%) | |
| 23 | 728 | 4.2% | |
| 22 | 728 | 4.2% | |
| 21 | 728 | 4.2% | |
| 20 | 728 | 4.2% | |
| 19 | 728 | 4.2% |
holiday
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| 0 | |
|---|---|
| 1 | 500 |
| Value | Count | Frequency (%) | |
| 0 | 16879 | 97.1% | |
| 1 | 500 | 2.9% |
weekday
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| Saturday | |
|---|---|
| Sunday | |
| Friday | |
| Monday | |
| Wednesday | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| Saturday | 2512 | 14.5% | |
| Sunday | 2502 | 14.4% | |
| Friday | 2487 | 14.3% | |
| Monday | 2479 | 14.3% | |
| Wednesday | 2475 | 14.2% | |
| Thursday | 2471 | 14.2% | |
| Tuesday | 2453 | 14.1% |
Length
| Max length | 9 |
|---|---|
| Mean length | 7.14183785 |
| Min length | 6 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 12 | 70.6% | |
| Uppercase_Letter | 5 | 29.4% |
| Value | Count | Frequency (%) | |
| Latin | 17 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 17 | 100.0% |
workingday
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 11865 | 68.3% | |
| 0 | 5514 | 31.7% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| Clear | |
|---|---|
| Misty Cloudy | |
| Light Snow | 1419 |
| Thunderstorm | 3 |
| Value | Count | Frequency (%) | |
| Clear | 11413 | 65.7% | |
| Misty Cloudy | 4544 | 26.1% | |
| Light Snow | 1419 | 8.2% | |
| Thunderstorm | 3 | < 0.1% |
Length
| Max length | 12 |
|---|---|
| Mean length | 7.239714598 |
| Min length | 5 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 16 | 72.7% | |
| Uppercase_Letter | 5 | 22.7% | |
| Space_Separator | 1 | 4.5% |
| Value | Count | Frequency (%) | |
| Latin | 21 | 95.5% | |
| Common | 1 | 4.5% |
| Value | Count | Frequency (%) | |
| ASCII | 22 | 100.0% |
| Distinct count | 50 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4969871684 |
|---|---|
| Minimum | 0.02 |
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0.02 |
|---|---|
| 5-th percentile | 0.2 |
| Q1 | 0.34 |
| median | 0.5 |
| Q3 | 0.66 |
| 95-th percentile | 0.8 |
| Maximum | 1 |
| Range | 0.98 |
| Interquartile range (IQR) | 0.32 |
Descriptive statistics
| Standard deviation | 0.1925561212 |
|---|---|
| Coefficient of variation (CV) | 0.3874468668 |
| Kurtosis | -0.9418442041 |
| Mean | 0.4969871684 |
| Median Absolute Deviation (MAD) | 0.1651747656 |
| Skewness | -0.006020883348 |
| Sum | 8637.14 |
| Variance | 0.03707785983 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.02 0.09 0.13 0.15 0.17 ... 0.83 0.87 0.93 0.97 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.62 | 726 | 4.2% | |
| 0.66 | 693 | 4.0% | |
| 0.64 | 692 | 4.0% | |
| 0.7 | 690 | 4.0% | |
| 0.6 | 675 | 3.9% | |
| 0.36 | 671 | 3.9% | |
| 0.34 | 645 | 3.7% | |
| 0.3 | 641 | 3.7% | |
| 0.4 | 614 | 3.5% | |
| 0.32 | 611 | 3.5% | |
| Other values (40) | 10721 | 61.7% |
| Value | Count | Frequency (%) | |
| 0.02 | 17 | 0.1% | |
| 0.04 | 16 | 0.1% | |
| 0.06 | 16 | 0.1% | |
| 0.08 | 17 | 0.1% | |
| 0.1 | 51 | 0.3% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.98 | 1 | < 0.1% | |
| 0.96 | 16 | 0.1% | |
| 0.94 | 17 | 0.1% | |
| 0.92 | 49 | 0.3% |
| Distinct count | 65 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4757751021 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.2121 |
| Q1 | 0.3333 |
| median | 0.4848 |
| Q3 | 0.6212 |
| 95-th percentile | 0.7424 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.2879 |
Descriptive statistics
| Standard deviation | 0.1718502156 |
|---|---|
| Coefficient of variation (CV) | 0.3612005228 |
| Kurtosis | -0.8454118948 |
| Mean | 0.4757751021 |
| Median Absolute Deviation (MAD) | 0.1453236352 |
| Skewness | -0.09042885856 |
| Sum | 8268.4955 |
| Variance | 0.02953249661 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.0682 0.11365 0.17425 0.20455 ... 0.79545 0.82575 0.85605 0.9015 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.6212 | 988 | 5.7% | |
| 0.5152 | 618 | 3.6% | |
| 0.4091 | 614 | 3.5% | |
| 0.3333 | 600 | 3.5% | |
| 0.6667 | 593 | 3.4% | |
| 0.6061 | 588 | 3.4% | |
| 0.5303 | 579 | 3.3% | |
| 0.5 | 575 | 3.3% | |
| 0.4545 | 559 | 3.2% | |
| 0.303 | 549 | 3.2% | |
| Other values (55) | 11116 | 64.0% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 0.0152 | 4 | < 0.1% | |
| 0.0303 | 8 | < 0.1% | |
| 0.0455 | 9 | 0.1% | |
| 0.0606 | 14 | 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 0.9848 | 2 | < 0.1% | |
| 0.9545 | 1 | < 0.1% | |
| 0.9242 | 5 | < 0.1% | |
| 0.9091 | 5 | < 0.1% |
hum
Real number (ℝ≥0)
| Distinct count | 89 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6272288394 |
|---|---|
| Minimum | 0 |
| Maximum | 1 |
| Zeros | 22 |
| Zeros (%) | 0.1% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.31 |
| Q1 | 0.48 |
| median | 0.63 |
| Q3 | 0.78 |
| 95-th percentile | 0.93 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.3 |
Descriptive statistics
| Standard deviation | 0.1929298341 |
|---|---|
| Coefficient of variation (CV) | 0.3075908216 |
| Kurtosis | -0.8261167359 |
| Mean | 0.6272288394 |
| Median Absolute Deviation (MAD) | 0.163311399 |
| Skewness | -0.1112871494 |
| Sum | 10900.61 |
| Variance | 0.03722192087 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.04 0.145 0.185 0.225 ... 0.895 0.925 0.95 0.985 1. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0.88 | 657 | 3.8% | |
| 0.83 | 630 | 3.6% | |
| 0.94 | 560 | 3.2% | |
| 0.87 | 488 | 2.8% | |
| 0.7 | 430 | 2.5% | |
| 0.66 | 388 | 2.2% | |
| 0.65 | 387 | 2.2% | |
| 0.69 | 359 | 2.1% | |
| 0.55 | 352 | 2.0% | |
| 0.74 | 341 | 2.0% | |
| Other values (79) | 12787 | 73.6% |
| Value | Count | Frequency (%) | |
| 0 | 22 | 0.1% | |
| 0.08 | 1 | < 0.1% | |
| 0.1 | 1 | < 0.1% | |
| 0.12 | 1 | < 0.1% | |
| 0.13 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 270 | 1.6% | |
| 0.97 | 1 | < 0.1% | |
| 0.96 | 3 | < 0.1% | |
| 0.94 | 560 | 3.2% | |
| 0.93 | 331 | 1.9% |
| Distinct count | 30 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1900976063 |
|---|---|
| Minimum | 0 |
| Maximum | 0.8507 |
| Zeros | 2180 |
| Zeros (%) | 12.5% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.1045 |
| median | 0.194 |
| Q3 | 0.2537 |
| 95-th percentile | 0.4179 |
| Maximum | 0.8507 |
| Range | 0.8507 |
| Interquartile range (IQR) | 0.1492 |
Descriptive statistics
| Standard deviation | 0.1223402286 |
|---|---|
| Coefficient of variation (CV) | 0.6435653291 |
| Kurtosis | 0.5908204107 |
| Mean | 0.1900976063 |
| Median Absolute Deviation (MAD) | 0.09631231746 |
| Skewness | 0.5749052035 |
| Sum | 3303.7063 |
| Variance | 0.01496713153 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.0448 0.09705 0.1194 0.20895 ... 0.4776 0.5373 0.597 0.67165 0.8507 ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 2180 | 12.5% | |
| 0.1343 | 1738 | 10.0% | |
| 0.1642 | 1695 | 9.8% | |
| 0.194 | 1657 | 9.5% | |
| 0.1045 | 1617 | 9.3% | |
| 0.2239 | 1513 | 8.7% | |
| 0.0896 | 1425 | 8.2% | |
| 0.2537 | 1295 | 7.5% | |
| 0.2836 | 1048 | 6.0% | |
| 0.2985 | 808 | 4.6% | |
| Other values (20) | 2403 | 13.8% |
| Value | Count | Frequency (%) | |
| 0 | 2180 | 12.5% | |
| 0.0896 | 1425 | 8.2% | |
| 0.1045 | 1617 | 9.3% | |
| 0.1343 | 1738 | 10.0% | |
| 0.1642 | 1695 | 9.8% |
| Value | Count | Frequency (%) | |
| 0.8507 | 2 | < 0.1% | |
| 0.8358 | 1 | < 0.1% | |
| 0.806 | 2 | < 0.1% | |
| 0.7761 | 1 | < 0.1% | |
| 0.7463 | 2 | < 0.1% |
| Distinct count | 322 |
|---|---|
| Unique (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.67621842 |
|---|---|
| Minimum | 0 |
| Maximum | 367 |
| Zeros | 1581 |
| Zeros (%) | 9.1% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4 |
| median | 17 |
| Q3 | 48 |
| 95-th percentile | 138.1 |
| Maximum | 367 |
| Range | 367 |
| Interquartile range (IQR) | 44 |
Descriptive statistics
| Standard deviation | 49.30503039 |
|---|---|
| Coefficient of variation (CV) | 1.382013918 |
| Kurtosis | 7.571001747 |
| Mean | 35.67621842 |
| Median Absolute Deviation (MAD) | 34.13996034 |
| Skewness | 2.499236891 |
| Sum | 620017 |
| Variance | 2430.986021 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 3.5 5.5 ... 187.5 240.5 275.5 311.5 367. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 0 | 1581 | 9.1% | |
| 1 | 1082 | 6.2% | |
| 2 | 798 | 4.6% | |
| 3 | 697 | 4.0% | |
| 4 | 561 | 3.2% | |
| 5 | 509 | 2.9% | |
| 6 | 448 | 2.6% | |
| 7 | 405 | 2.3% | |
| 8 | 377 | 2.2% | |
| 9 | 348 | 2.0% | |
| Other values (312) | 10573 | 60.8% |
| Value | Count | Frequency (%) | |
| 0 | 1581 | 9.1% | |
| 1 | 1082 | 6.2% | |
| 2 | 798 | 4.6% | |
| 3 | 697 | 4.0% | |
| 4 | 561 | 3.2% |
| Value | Count | Frequency (%) | |
| 367 | 1 | < 0.1% | |
| 362 | 1 | < 0.1% | |
| 361 | 1 | < 0.1% | |
| 357 | 1 | < 0.1% | |
| 356 | 1 | < 0.1% |
| Distinct count | 776 |
|---|---|
| Unique (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 153.7868692 |
|---|---|
| Minimum | 0 |
| Maximum | 886 |
| Zeros | 24 |
| Zeros (%) | 0.1% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 34 |
| median | 115 |
| Q3 | 220 |
| 95-th percentile | 465 |
| Maximum | 886 |
| Range | 886 |
| Interquartile range (IQR) | 186 |
Descriptive statistics
| Standard deviation | 151.3572859 |
|---|---|
| Coefficient of variation (CV) | 0.9842016207 |
| Kurtosis | 2.750017757 |
| Mean | 153.7868692 |
| Median Absolute Deviation (MAD) | 114.3961551 |
| Skewness | 1.557904226 |
| Sum | 2672662 |
| Variance | 22909.028 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 5.000e-01 2.500e+00 6.500e+00 9.500e+00 ... 4.925e+02 5.515e+02 7.695e+02 8.135e+02 8.860e+02], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 4 | 307 | 1.8% | |
| 3 | 294 | 1.7% | |
| 5 | 287 | 1.7% | |
| 6 | 266 | 1.5% | |
| 2 | 245 | 1.4% | |
| 1 | 201 | 1.2% | |
| 7 | 200 | 1.2% | |
| 8 | 190 | 1.1% | |
| 9 | 178 | 1.0% | |
| 11 | 140 | 0.8% | |
| Other values (766) | 15071 | 86.7% |
| Value | Count | Frequency (%) | |
| 0 | 24 | 0.1% | |
| 1 | 201 | 1.2% | |
| 2 | 245 | 1.4% | |
| 3 | 294 | 1.7% | |
| 4 | 307 | 1.8% |
| Value | Count | Frequency (%) | |
| 886 | 1 | < 0.1% | |
| 885 | 1 | < 0.1% | |
| 876 | 2 | < 0.1% | |
| 871 | 1 | < 0.1% | |
| 860 | 1 | < 0.1% |
| Distinct count | 869 |
|---|---|
| Unique (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 189.4630876 |
|---|---|
| Minimum | 1 |
| Maximum | 977 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 40 |
| median | 142 |
| Q3 | 281 |
| 95-th percentile | 563.1 |
| Maximum | 977 |
| Range | 976 |
| Interquartile range (IQR) | 241 |
Descriptive statistics
| Standard deviation | 181.3875991 |
|---|---|
| Coefficient of variation (CV) | 0.9573769823 |
| Kurtosis | 1.417203281 |
| Mean | 189.4630876 |
| Median Absolute Deviation (MAD) | 142.3998489 |
| Skewness | 1.277411604 |
| Sum | 3292679 |
| Variance | 32901.4611 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 7.5 11.5 17.5 ... 596.5 693.5 750.5 900.5 977. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 5 | 260 | 1.5% | |
| 6 | 236 | 1.4% | |
| 4 | 231 | 1.3% | |
| 3 | 224 | 1.3% | |
| 2 | 208 | 1.2% | |
| 7 | 198 | 1.1% | |
| 8 | 182 | 1.0% | |
| 1 | 158 | 0.9% | |
| 10 | 155 | 0.9% | |
| 11 | 147 | 0.8% | |
| Other values (859) | 15380 | 88.5% |
| Value | Count | Frequency (%) | |
| 1 | 158 | 0.9% | |
| 2 | 208 | 1.2% | |
| 3 | 224 | 1.3% | |
| 4 | 231 | 1.3% | |
| 5 | 260 | 1.5% |
| Value | Count | Frequency (%) | |
| 977 | 1 | < 0.1% | |
| 976 | 1 | < 0.1% | |
| 970 | 1 | < 0.1% | |
| 968 | 1 | < 0.1% | |
| 967 | 1 | < 0.1% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| 3 | |
|---|---|
| 2 | |
| 1 | |
| 4 |
| Value | Count | Frequency (%) | |
| 3 | 4496 | 25.9% | |
| 2 | 4409 | 25.4% | |
| 1 | 4242 | 24.4% | |
| 4 | 4232 | 24.4% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
month_int
Real number (ℝ≥0)
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.537775476 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.438775714 |
|---|---|
| Coefficient of variation (CV) | 0.5259855935 |
| Kurtosis | -1.201878197 |
| Mean | 6.537775476 |
| Median Absolute Deviation (MAD) | 2.982815041 |
| Skewness | -0.009253248383 |
| Sum | 113620 |
| Variance | 11.82517841 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 11.5 12. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 7 | 1488 | 8.6% | |
| 5 | 1488 | 8.6% | |
| 12 | 1483 | 8.5% | |
| 8 | 1475 | 8.5% | |
| 3 | 1473 | 8.5% | |
| 10 | 1451 | 8.3% | |
| 6 | 1440 | 8.3% | |
| 11 | 1437 | 8.3% | |
| 9 | 1437 | 8.3% | |
| 4 | 1437 | 8.3% | |
| Other values (2) | 2770 | 15.9% |
| Value | Count | Frequency (%) | |
| 1 | 1429 | 8.2% | |
| 2 | 1341 | 7.7% | |
| 3 | 1473 | 8.5% | |
| 4 | 1437 | 8.3% | |
| 5 | 1488 | 8.6% |
| Value | Count | Frequency (%) | |
| 12 | 1483 | 8.5% | |
| 11 | 1437 | 8.3% | |
| 10 | 1451 | 8.3% | |
| 9 | 1437 | 8.3% | |
| 8 | 1475 | 8.5% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.003682605 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 2502 |
| Zeros (%) | 14.4% |
| Memory size | 135.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.005771456 |
|---|---|
| Coefficient of variation (CV) | 0.6677707733 |
| Kurtosis | -1.255996891 |
| Mean | 3.003682605 |
| Median Absolute Deviation (MAD) | 1.720868973 |
| Skewness | -0.002998221376 |
| Sum | 52201 |
| Variance | 4.023119134 |
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 5.5 6. ], "bayesian blocks" binning strategy used)
| Value | Count | Frequency (%) | |
| 6 | 2512 | 14.5% | |
| 0 | 2502 | 14.4% | |
| 5 | 2487 | 14.3% | |
| 1 | 2479 | 14.3% | |
| 3 | 2475 | 14.2% | |
| 4 | 2471 | 14.2% | |
| 2 | 2453 | 14.1% |
| Value | Count | Frequency (%) | |
| 0 | 2502 | 14.4% | |
| 1 | 2479 | 14.3% | |
| 2 | 2453 | 14.1% | |
| 3 | 2475 | 14.2% | |
| 4 | 2471 | 14.2% |
| Value | Count | Frequency (%) | |
| 6 | 2512 | 14.5% | |
| 5 | 2487 | 14.3% | |
| 4 | 2471 | 14.2% | |
| 3 | 2475 | 14.2% | |
| 2 | 2453 | 14.1% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 135.9 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 1419 |
| 4 | 3 |
| Value | Count | Frequency (%) | |
| 1 | 11413 | 65.7% | |
| 2 | 4544 | 26.1% | |
| 3 | 1419 | 8.2% | |
| 4 | 3 | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
First rows
| instant | dteday | season | yr | month | hr | holiday | weekday | workingday | weathersit | temp | atemp | hum | windspeed | casual | registered | cnt | season_int | month_int | weekday_int | weathersit_int | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2011-01-01 | Winter | 0 | January | 0 | 0 | Saturday | 0 | Clear | 0.24 | 0.2879 | 0.81 | 0.0000 | 3 | 13 | 16 | 1 | 1 | 6 | 1 |
| 1 | 2 | 2011-01-01 | Winter | 0 | January | 1 | 0 | Saturday | 0 | Clear | 0.22 | 0.2727 | 0.80 | 0.0000 | 8 | 32 | 40 | 1 | 1 | 6 | 1 |
| 2 | 3 | 2011-01-01 | Winter | 0 | January | 2 | 0 | Saturday | 0 | Clear | 0.22 | 0.2727 | 0.80 | 0.0000 | 5 | 27 | 32 | 1 | 1 | 6 | 1 |
| 3 | 4 | 2011-01-01 | Winter | 0 | January | 3 | 0 | Saturday | 0 | Clear | 0.24 | 0.2879 | 0.75 | 0.0000 | 3 | 10 | 13 | 1 | 1 | 6 | 1 |
| 4 | 5 | 2011-01-01 | Winter | 0 | January | 4 | 0 | Saturday | 0 | Clear | 0.24 | 0.2879 | 0.75 | 0.0000 | 0 | 1 | 1 | 1 | 1 | 6 | 1 |
| 5 | 6 | 2011-01-01 | Winter | 0 | January | 5 | 0 | Saturday | 0 | Misty Cloudy | 0.24 | 0.2576 | 0.75 | 0.0896 | 0 | 1 | 1 | 1 | 1 | 6 | 2 |
| 6 | 7 | 2011-01-01 | Winter | 0 | January | 6 | 0 | Saturday | 0 | Clear | 0.22 | 0.2727 | 0.80 | 0.0000 | 2 | 0 | 2 | 1 | 1 | 6 | 1 |
| 7 | 8 | 2011-01-01 | Winter | 0 | January | 7 | 0 | Saturday | 0 | Clear | 0.20 | 0.2576 | 0.86 | 0.0000 | 1 | 2 | 3 | 1 | 1 | 6 | 1 |
| 8 | 9 | 2011-01-01 | Winter | 0 | January | 8 | 0 | Saturday | 0 | Clear | 0.24 | 0.2879 | 0.75 | 0.0000 | 1 | 7 | 8 | 1 | 1 | 6 | 1 |
| 9 | 10 | 2011-01-01 | Winter | 0 | January | 9 | 0 | Saturday | 0 | Clear | 0.32 | 0.3485 | 0.76 | 0.0000 | 8 | 6 | 14 | 1 | 1 | 6 | 1 |
Last rows
| instant | dteday | season | yr | month | hr | holiday | weekday | workingday | weathersit | temp | atemp | hum | windspeed | casual | registered | cnt | season_int | month_int | weekday_int | weathersit_int | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17369 | 17370 | 2012-12-31 | Winter | 1 | December | 14 | 0 | Monday | 1 | Misty Cloudy | 0.28 | 0.2727 | 0.45 | 0.2239 | 62 | 185 | 247 | 1 | 12 | 1 | 2 |
| 17370 | 17371 | 2012-12-31 | Winter | 1 | December | 15 | 0 | Monday | 1 | Misty Cloudy | 0.28 | 0.2879 | 0.45 | 0.1343 | 69 | 246 | 315 | 1 | 12 | 1 | 2 |
| 17371 | 17372 | 2012-12-31 | Winter | 1 | December | 16 | 0 | Monday | 1 | Misty Cloudy | 0.26 | 0.2576 | 0.48 | 0.1940 | 30 | 184 | 214 | 1 | 12 | 1 | 2 |
| 17372 | 17373 | 2012-12-31 | Winter | 1 | December | 17 | 0 | Monday | 1 | Misty Cloudy | 0.26 | 0.2879 | 0.48 | 0.0896 | 14 | 150 | 164 | 1 | 12 | 1 | 2 |
| 17373 | 17374 | 2012-12-31 | Winter | 1 | December | 18 | 0 | Monday | 1 | Misty Cloudy | 0.26 | 0.2727 | 0.48 | 0.1343 | 10 | 112 | 122 | 1 | 12 | 1 | 2 |
| 17374 | 17375 | 2012-12-31 | Winter | 1 | December | 19 | 0 | Monday | 1 | Misty Cloudy | 0.26 | 0.2576 | 0.60 | 0.1642 | 11 | 108 | 119 | 1 | 12 | 1 | 2 |
| 17375 | 17376 | 2012-12-31 | Winter | 1 | December | 20 | 0 | Monday | 1 | Misty Cloudy | 0.26 | 0.2576 | 0.60 | 0.1642 | 8 | 81 | 89 | 1 | 12 | 1 | 2 |
| 17376 | 17377 | 2012-12-31 | Winter | 1 | December | 21 | 0 | Monday | 1 | Clear | 0.26 | 0.2576 | 0.60 | 0.1642 | 7 | 83 | 90 | 1 | 12 | 1 | 1 |
| 17377 | 17378 | 2012-12-31 | Winter | 1 | December | 22 | 0 | Monday | 1 | Clear | 0.26 | 0.2727 | 0.56 | 0.1343 | 13 | 48 | 61 | 1 | 12 | 1 | 1 |
| 17378 | 17379 | 2012-12-31 | Winter | 1 | December | 23 | 0 | Monday | 1 | Clear | 0.26 | 0.2727 | 0.65 | 0.1343 | 12 | 37 | 49 | 1 | 12 | 1 | 1 |